A Discrete Optimization Approach to Supervised Ranking

نویسندگان

  • Dimitris Bertsimas
  • Allison Chang
  • Cynthia Rudin
چکیده

In many ranking tasks in machine learning, the goal is to construct a scoring function f:X → R, where X⊂R, that can be used to rank a set of labeled examples {(x�,y�)}��� � , where x� ∈ X and y� ∈ {0,1}, that are randomly drawn from an unknown distribution on X × {0,1}. We present a mixed integer programming (MIP) method to generate this scoring function. In particular, the scoring function is chosen to be a linear combination of features, i.e., f(x�) = wx�, such that the coefficients w are optimal with respect to a ranking quality measure (area under the curve, discounted cumulative gain, P-Norm Push, etc.) on a training set. Other methods for ranking approximate the ranking quality measure by a convex function, whereas our MIP approach is exact. As a result, the proposed approach provides a benchmark against which other methods can be judged. We use datasets from various applications, and show that this novel method performs well on both training and test data, compared with traditional machine learning techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Structural Binary Coding

Binary coding techniques, which compress originally high-dimensional data samples into short binary codes, are becoming increasingly popular due to their efficiency for information retrieval. Leveraging supervised information can dramatically enhance the coding quality, and hence improve search performance. There are few methods, however, that efficiently learn coding functions that optimize th...

متن کامل

A Harmonic Extension Approach for Collaborative Ranking

We present a new perspective on graph-based methods for collaborative ranking for recommender systems. Unlike user-based or item-based methods that compute a weighted average of ratings given by the nearest neighbors, or low-rank approximation methods using convex optimization and the nuclear norm, we formulate matrix completion as a series of semi-supervised learning problems, and propagate th...

متن کامل

A discrete-event optimization framework for mixed-speed train timetabling problem

Railway scheduling is a complex task of rail operators that involves the generation of a conflict-free train timetable. This paper presents a discrete-event simulation-based optimization approach for solving the train timetabling problem to minimize total weighted unplanned stop time in a hybrid single and double track railway networks. The designed simulation model is used as a platform for ge...

متن کامل

A novel technique for a class of singular boundary value problems

In this paper, Lagrange interpolation in Chebyshev-Gauss-Lobatto nodes is used to develop a procedure for finding discrete and continuous approximate solutions of a singular boundary value problem. At first, a continuous time optimization problem related to the original singular boundary value problem is proposed. Then, using the Chebyshev- Gauss-Lobatto nodes, we convert the continuous time op...

متن کامل

Structured Prediction by Conditional Risk Minimization

We propose a general approach for supervised learning with structured output spaces, such as combinatorial and polyhedral sets, that is based on minimizing estimated conditional risk functions. Given a loss function defined over pairs of output labels, we first estimate the conditional risk function by solving a (possibly infinite) collection of regularized least squares problems. A prediction ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010